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Nascent untranslated transcripts in 
bacteria are prone to generating 
RNA-DNA hybrids (R-loops); Rho- 
dependent transcription termination 
acts to reduce their prevalence. Here we 
discuss the mechanisms of R-loop forma- 
tion and growth inhibition in bacteria. 

Introduction and Historical 
Perspective 

The term R-loop refers to a nucleic acid 
structure that is comprised of single- 
stranded RNA base-paired with one strand 
of duplex DNA leading to displacement of 
the second (complementary) DNA strand. 
This nomenclature derives from the term 
"D-loop," which describes an intermediate 
structure that is formed during recombi- 
nation and is comprised of two strands of 
DNA base-paired with one another and a 
third displaced DNA strand. R-loops are 
comprised of one strand of RNA and two 
of DNA.' R-loops can occur either during 
replication, when DnaG (or an equivalent) 
primase synthesizes the RNA primer for 
Okazaki fragments,^ or from transcripts 
synthesized by RNA polymerase (RNAP). 
The focus of this article is limited to tran- 
scription-associated R-loops in bacterial 
cells. 

Historically, a role for R-loops in bac- 
terial physiology was first invoked by 
Tomizawa's group in the 1980s, when 
they showed that, during replication of 
the ColEl family of plasmids, the plas- 
mid-encoded RNA-II transcript forms an 
R-loop, which is cleaved by the enzyme 
RNase HI (that targets RNA exclusively 
in RNA-DNA hybrids) to generate the 
3'-OH end needed for initiation of DNA 



synthesis.^''' Subsequently, Kogoma's 
group provided evidence that E. coli cells 
deficient for either RNase HI or RecG (a 
helicase that is able to unwind R-loops) 
exhibit a phenomenon referred to as con- 
stitutive stable DNA replication (cSDR), 
in which transcription-generated R-loops 
mediate the establishment of replication 
forks for chromosomal DNA replication.' 
Kogoma and colleagues also showed that 
mutants defective for both RNase HI 
and RecG are inviable, from which they 
inferred that R-loops are constantly being 
generated in E. coli and are lethal unless 
they are removed.' Finally, since R-loops 
constrain DNA supercoils, their forma- 
tion may be expected to be promoted by 
increased negative supercoiling of DNA; 
work from Drolet's group'''' has convinc- 
ingly demonstrated that transcription- 
associated R-loop formation is a major 
contributor to sickness in topoisomerase 
I-deficient mutants of E. coli. 

R-loop Mechanisms: 
The Questions 

The studies mentioned above have pre- 
sented a fairly simple picture of R-loops: 
they are formed by the heteroduplexing of 
RNA transcripts with their cognate DNA 
template strands, and are toxic when they 
occur in excess. However, the devil, as 
they say, is in the detail, and the following 
questions are of interest in understanding 
R-loop mechanisms (see Fig. 1). 

1. How does an RNA transcript insert 
itself into duplex DNA to generate the 
R-loop? There are two dominant mod- 
els that are called the "bubble extension" 
and "re-annealing" models, respectively 
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Figure 1. Mechanisms and models for R-loop generation during bacterial transcription. Three models of R-loop formation are schematically depicted, 
as explained in the text; (A) bubble extension, (B) re-annealing of nascent or free transcripts and, (C) following dislodgement of backtracked RNAP. 
DNA, RNA and RNAP are shown in black, red and blue, respectively. Direction of transcription elongation complex movement is from left to right in all 
panels. 



(Some authors^ have also referred to them 
synonymously as the "extended hybrid" 
and "thread back" models, respectively). 
In the former (Fig. lA), the RNA syn- 
thesized within the transcription elon- 
gation complex remains associated as a 
persistent RNA-DNA hybrid, even as the 
complex continues to translocate cata- 
lytically along DNA (that is, the RNA 
does not undergo the normal process** 
of its separation from DNA, beyond the 
short stretch of 8 to 9 nucleotides at its 



3'-end, to emerge through the exit chan- 
nel of RNAP). Tomizawa's group^ had 
suggested this mechanism for formation 
of the persistent RNA-DNA hybrid from 
the RNA-II transcript during ColEl 
plasmid replication, but this was before 
the atomic structure of RNAP had been 
determined. In light of the structural 
information now available,* it is difficult 
to envisage how the mechanism to sepa- 
rate RNA from DNA within the enzyme 
can be bypassed. In the second model 



(Fig. IB), the R-loop is formed not as 
an extension of the transcription bubble 
but by the re-annealing of RNA with 
its cognate template DNA strand. More 
recently, a third model of R-loop forma- 
tion has also been proposed,'''" by which 
a longer RNA-DNA hybrid (about 25 bp) 
than that in a normal transcription bub- 
ble can be formed after dislodgement of a 
backtracked RNAP (Fig. IC); this model 
is further discussed in a subsequent sec- 
tion below. 
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2. In a re-annealing model of R-loop 
formation, how is the RNA strand inserted 
into DNA? Three mechanistic possibilities 
may be envisaged (Fig. IB): 3'-end inva- 
sion, 5 -end invasion, or insertion from the 
middle, of the RNA molecule into duplex 
DNA. Of these, the 3'-end invasion 
mechanism has been proposed to occur 
for cSDR, wherein the R-loop is believed 
to be formed much like the D-loop, with 
the 3 '-end of RecA-bound nucleic acid 
(RNA or DNA) invading into duplex 
DNA;' 5'-end invasion has been suggested 
as the mechanism for R-loop formation in 
Rho-deficient strains, as further discussed 
below. The third possibility, insertion 
of the middle of an RNA molecule into 
DNA, has been schematically depicted' 
as an apparent mechanism of R-loop 
generation in topoisomerase I-deficient 
E. coli, but it would require the action of 
a nicking-closing enzyme that can act on 
RNA-DNA hybrids (the existence of such 
an enzyme is currently unknown). On the 
other hand, an endonucleolytic cleavage 
within such RNA would enable R-loop 
formation by the 3'- or 5'-end invasion 
approaches. 

3. Further, in a re-annealing model, 
does the RNA substrate exist as a nascent 
transcript (that is, still bound to RNAP) 
or as a free transcript in the cytoplasm? 
For a nascent transcript, the 3'-end inva- 
sion possibility for R-loop generation will 
not apply since the RNA 3 '-end is held 
within the RNAP. Additionally, in accord 
with the twin-supercoiling domain con- 
cept of transcription elongation," it is 
the nascent transcript alone that will be 
associated with a region of negative super- 
coiling behind the moving RNAP, which 
could act to facilitate R-loop formation. 

4. Is R-loop formation RecA-dependent 
or -independent? More generally, are there 
factors that directly modulate the effi- 
ciency of R-loop formation? As mentioned 
above, the proposal for cSDR is that it is 
RecA-dependent.' 

5. Is the propensity for R-loop forma- 
tion dictated by nucleotide sequence? The 
answer would appear to be in the affirma- 
tive, with G-rich segments in the RNA 
being conducive for both the nucleation 
and extension of the RNA-DNA hybrid.'^ 

6. Finally, since transcription and 
translation are coupled to each other and 



occur in a single cytoplasmic compart- 
ment in bacteria, what is the effect (if any) 
of ribosomal engagement of the nascent 
and/or free transcripts on R-loop forma- 
tion? Drolet and colleagues'' have provided 
evidence that the frequency of R-loops 
is increased in topoisomerase I-deficient 
mutants when translation is inhibited, 
suggesting that ribosome-free transcripts 
are more prone to generating R-loops. 

R-loops and Rho-Dependent 
Transcription Termination 

It is in the context of these questions and 
models that we wish to discuss the find- 
ings of our recent paper" on the role of 
Rho-dependent transcription termination 
(RDTT) in the avoidance of excessive 
R-loops in E. coli. RDTT is a process''*'" 
in which the Rho protein binds a nascent 
unstructured transcript that is not being 
simultaneously translated, and then acts 
to terminate transcription. Another pro- 
tein that participates in RDTT is NusG, 
and both Rho and NusG are individually 
essential for viability in E. coli. 

We had earlier proposed"" '^ a unitary 
model in which: (i) nascent transcripts in 
bacteria are inherently prone to re-anneal- 
ing with upstream DNA to generate 
growth-inhibitory R-loops; (ii) two gen- 
eral mechanisms that act to preclude such 
R-loop generation are transcription-trans- 
lation coupling (for mRNAs of protein- 
coding genes) and secondary structure 
formation in RNA (e.g., in rRNA) and; 
(iii) when a nascent unstructured tran- 
script fails to be engaged by ribosomes for 
translation, RDTT acts to abrogate syn- 
thesis of the transcript so that the substrate 
for R-loop formation itself is not available. 
This model predicts that R-loops would 
occur at increased frequency in mutants 
that are deficient for RDTT; several indi- 
rect lines of genetic evidence have been 
obtained by us earlier in support of this 
prediction."''"* 

The results from our recent study" 
have shown conclusively that the essential 
role of RDTT is the prevention of exces- 
sive genome-wide R-loops in E. coli. One 
of the assays for detection of R-looped 
regions exploits the sensitivity to sodium 
bisulfite of the displaced non-template 
DNA strand that results in C-to-T changes 



in nucleotide sequence of the latter.^''^ 
With the aid of this assay, we could show 
that R-loops occur genome-wide at a basal 
level in wild-type E. coli, and that the fre- 
quency of their occurrence is elevated in 
a mutant deficient for RDTT.'^ Further, 
this study also established that excessive 
R-loops are indeed responsible for the 
lethality associated with complete absence 
of Rho or NusG in wild-type E. coli, since 
such lethality could be rescued by expres- 
sion of UvsW, an R-loop helicase from 
phage T4." 

Thus, the model that RDTT acts as 
a back-up measure to prevent the default 
occurrence of R-loops from nascent 
untranslated transcripts is strongly sup- 
ported by the new findings. This is 
important, not just because of the value 
of the model's intrinsic message, but also 
because many of the questions regarding 
R-loop mechanisms can now be more 
definitively answered (at least with respect 
to those R-loops whose formation is 
modulated by RDTT). For example, the 
following mechanisms suggested to oper- 
ate in R-loop formation are excluded in 
this model (since, in these cases, RDTT 
would not be able to contribute to reduced 
R-loop occurrence): bubble extension, 
3 '-end invasion, the involvement of free (as 
opposed to nascent) transcripts, and RecA- 
dependence. The last is excluded because 
our studies have indicated that R-loop 
formation in RDTT-defective mutants is 
equally feasible in both RecA-proficient 
and -deficient cells.''*'"' Furthermore, clues 
to the nature of nucleotide sequence pref- 
erences for R-loop generation in vivo are 
likely to be available from the bisulfite- 
sensitive hotspots that have been identified 
in the genome-wide mapping studies.'^ 

Given that Rho function is correlated 
with reduced R-loop occurrence in E. coli, 
is it possible that it does so, not by pre- 
venting their occurrence, but through its 
enzymatic activity as a 5 -3' RNA-DNA 
helicase" (that is, by unwinding R-loops 
which, under this scenario, could have 
arisen from nascent or free transcripts and 
by any of the three insertion mechanisms 
described above)? We think it unlikely 
because, if true, it would represent a role for 
Rho where it acts independently of NusG, 
since there is no evidence that NusG can 
directly stimulate Rho's helicase activity; 
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on the other hand, our results have shown 
that the increased occurrence of R-loops 
is manifest in both Rho- and NusG- 
deficient mutants.'^'"'''* 

Continuing with the assumption 
that R-loops detected by the bisulfite- 
sensitivity assay (even in wild-type cells) 
are a proxy for the occurrence of nascent 
untranslated transcripts in E. coli, we may 
infer that widespread low-level transcrip- 
tion from both DNA strands exists across 
the entire genome, and that this results 
in around 5% of the entire genome being 
R-looped." Such an inference is well sup- 
ported by the recent findings of Peters et 
al.^° for pervasive antisense transcription 
in E. coli that is suppressed by RDTT. 
These observations are reminiscent also of 
those from the ENCODE project,^' which 
have demonstrated that > 70% of the 
human genome is represented in primary 
transcripts of different cells. 

If excessive R-loops are lethal and 
RDTT acts to reduce their occurrence, 
why is Rho dispensable in some bac- 
teria, for example. Bacillus subtilis and 
Staphylococcus aureus''}'' One possibility 
is that in such bacteria, the enzymes for 
R-loop removal, such as RNase HI and 
RecG, are more active than they are in 
E. coli. 

R-loops, RNAP Backtracking, and 
Transcription-Replication Conflict 

In any discussion on R-loops, it is impor- 
tant to consider their inter-relationship 
with transcription-replication conflicts 
and with backtracked RNAP. Using 
E. coli mutants in which rRNA operon 
transcription was oriented head-on 
against replication fork direction, Michel 
and coworkers^^ showed that R-loops are 
generated at the collision loci, which are 
toxic unless they are removed; the toxicity 
is associated with replication fork stalling 
and/or collapse. Several other studies''"'^^ 
have also concluded that R-loops cause 
problems for replication, although 
Aguilera and colleagues^* have suggested 
that R-loop toxicity is associated with 
transcription elongation impairment. 

Evidence for the notion that RNAP 
backtracking is associated with R-loop 
formation comes from several different 
observations. First, the lethality of E. coli 



mutants lacking Rho or NusG is rescued 
by a mutation in RNAP that renders 
the enzyme resistant to backtracking.'^ 
Second, the toxicity of R-loop-associated 
transcription-replication conflicts in 
strains with inverted rRNA operons is 
also alleviated by an apparently similar 
kind of RNAP mutation. Finally, Dutta 
et al.' have shown that transcription that 
is not coupled to translation is associ- 
ated with RNAP backtracking as well 
as replication-dependent generation of 
DNA double-strand ends, with R-loops 
being implicated in the process, since it 
is reversed upon overexpression of RNase 
HI. 

It therefore appears that RNAP back- 
tracking and R-loop formation are inti- 
mately correlated; this notion is supported 
also by the model that the leading ribo- 
some on a nascent transcript is in physical 
contact (via NusG) with RNAP to pro- 
mote transcription elongation and to pre- 
vent the latter from entering a backtracked 
conformation.^''^'' Whether it is back- 
tracking that promotes R-loop formation, 
or R-loops that promote backtracking, or 
both, remains to be determined. A related 
question will be whether transcription- 
replication conflicts promote R-loops, 
or vice versa. The findings of Boubakri 
et al.^^ with the inverted rRNA operons 
do provide clear support for the former, 
whereas those of Dutta et al.' do the same 
for the latter. A model that envisages pro- 
motion of R-loop formation by backtrack- 
ing will explain our earlier finding'^ that, 
in mutants defective for RDTT, R-loops 
apparently provide a pathway of mRNA 
turnover that is an alternative to that cata- 
lyzed by RNase E. 

To explain their observations of R-loop 
mediated and replication-dependent tox- 
icity of backtracked RNAP, Nudler's 
group'''" has proposed a model (Fig. IC) 
in which (i) an RNA-DNA hybrid (up 
to 25 bp long) is formed (following dis- 
lodgement of a backtracked RNAP by a 
co-directional replication fork) through 
restoration of the register between the 
3'-end of the nascent transcript and DNA 
and; (ii) re-initiation of replication from 
the RNA 3'-end results in a residual nick 
in the leading strand that is converted 
into a double strand end upon passage of 
a subsequent replication fork through this 



region. This model represents the third 
possible mechanism of R-loop genera- 
tion (in addition to the bubble extension 
and re-annealing mechanisms described 
above). However, this model will not 
account for R-loops several hundreds of bp 
long that have been detected in both bac- 
teria and eukaryotes,*'^''^ as is also evident 
from our genome-wide bisulfite sensitivity 
studies.'^ We therefore suggest that the 
R-loops associated with RNAP backtrack- 
ing are generated by 5'-end invasion and 
re-annealing of nascent transcripts. 

Beneficial Effects of R-loops? 

Given that a basal level of genome-wide 
R-loops exists even in wild-type E. coli, 
the possibility should be considered 
whether, at this level of occurrence, they 
confer a beneficial effect to the cells. 
Thus, they may assist in the mechanisms 
of DNA damage repair, for example by 
serving to bridge double-strand breaks in 
the DNA or by priming replication restart 
through cSDR. It is interesting to note in 
this context that R-loop "annealases," that 
is, enzymes that promote the invasion of 
RNA into a DNA duplex, appear to exist 
in E. coliP 

R-loops in Eukaryotic 
Transcription 

Finally, the structure of RNAP as well as 
the processes of transcription initiation 
and elongation are quite well conserved 
across bacteria and eukaryotes.*'^' It is 
not unusual, therefore, that pathological 
R-loops occur also in eukaryotic cells, aris- 
ing from nascent transcripts that fail to be 
engaged by proteins in co-transcriptional 
processes, such as splicing, polyadenyl- 
ation or export from the nucleus into the 
cytoplasm.^" Physiologically, R-loops have 
been shown in cells of the immune system 
to participate in class switch recombina- 
tion and somatic hypermutation^ '^ and, in 
this case too, evidence for a re-annealing 
mechanism of R-loop formation has been 
obtained.^ 

Concluding Remarks 

In summary, transcription-associated 
R-loops are a feature of all living cells. 
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In E. coli, they apparently arise by 5'-end 
invasion and re-annealing of nascent 
untranslated transcripts across the entire 
genome, and the frequency of their occur- 
rence is kept in check by RDTT. Although 
R-loops do serve some physiological func- 
tions, such as during plasmid replication 
and cSDR in bacteria and for immune cell 
maturation in eukaryotes, for the most 
part they are pathological and give rise 
to transcription-replication conflicts and 
genome instability. We believe that the 
stage is now set for a more comprehensive 
understanding of how R-loops are gen- 
erated and of the consequences of their 
occurrence, in both bacteria and eukary- 
otic cells. 
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